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Effects of variations of load distribution on network 

performance 

David Arrowsmithl, Mario di Bernardo*”, Francesco Sorrentino”! 


Abstract —This paper is concerned with the characteriza¬ 
tion of the relationship between topology and traffic dynam¬ 
ics. We use a model of network generation that allows the 
transition from random to scale free networks. Specifically, 
we consider three different topological types of network: 
random, scale-free with 7 = 3, scale-free with 7 = 2. By using 
a novel LRD traffic generator, we observe best performance, 
in terms of transmission rates and delivered packets, in the 
case of random networks. We show that, even if scale-free 
networks are characterized by shorter characteristic-path- 
length (the lower the exponent, the lower the path-length), 
they show worst performances in terms of communication. 
We conjecture this could be explained in terms of changes in 
the load distribution, defined here as the number of shortest 
paths going through a given vertex. In fact, that distribu¬ 
tion is characterized by (i) a decreasing mean (ii) an increas¬ 
ing standard deviation, as the networks becomes scale-free 
(especially scale-free networks with low exponents). The use 
of a degree-independent server also discriminates against a 
scale-free structure. As a result, since the model is un¬ 
controlled, most packets will go through the same vertices, 
favoring the onset of congestion. 

I. Introduction 

Much research effort has been spent recently in under¬ 
standing the relationship between network topological fea¬ 
tures and communication performances. In the prob¬ 
lem of relating the degree distribution (i.e. the distribution 
of the number of incident links at a given node) to the load 
distribution (that is the number of shortest paths passing 
through a given vertex) in a given network is discussed. 
The load parameter is shown to be useful to give a sta¬ 
tistical measure of the probability that a generic packet, 
travelling in the network, will pass through a given ver¬ 
tex. Nevertheless, it is not taken into account that, in real 
world applications, packets are stored in routers’ queues 
while going from origin to destination, causing time delays 
in the communication. 

As a first approximation, it would be natural to make the 
most general hypothesis about the structure of the underly¬ 
ing network, that is, to think of it as a random graph. Un¬ 
fortunately, real networks show statistical properties that 
are far from being completely random. The most impor¬ 
tant difference is that they have typically power law degree 
distributions with exponents between 2 and 3 [2] . For that 
reason here we have considered three different topologies, 
in the order: random, scale-free with 7 = 3, scale-free with 
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7 = 2. 

In this paper, we will consider a packet transport model 
that has been widely studied in the literature (see m, 
1^ for further details), in order to compare the main indi¬ 
cators of the network performance, specifically the delivery 
time and the number of delivered packets (or throughput), 
as the underlying topology is varied. 

II. From Random to Scale-Free Networks 

As a first general approximation we can think of the 
underlying network to be represented by a random graph 
(cf. the well-known model by Erdos and Renyi presented 
in 1^). A network with N vertices and M edges is built 
as follows: we select with uniform probability two of the 
N possible vertices and link them unless they are either 
already connected or self-links are generated. We repeat 
this iteration M times. 

While the ER graph is pioneering, various properties of 
this model are not in accordance with those of complex net¬ 
works recently discovered in the real world. For example, 
the distribution of the number of edges incident on each 
vertex, called the degree distribution^ is Poissonian for the 
ER graph, while it follows a power law decay with increas¬ 
ing degree for many real world networks, called scale free 
(SF) networks [7], [S], 12) I2' That is real networks differ 
deeply from ER networks since they are characterized by a 
power-law degree distribution. 

In order to cause the transition from random to scale-free 
network we use the static model recently introduced in pQ . 
Vertices are indexed by an integer i, for [i = 1...., N), and 
assigned a weight or fitness pi = i~°‘ where a is a parameter 
between 0 and 1. Two different vertices are selected with 
probabilities equal to the normalized weights, Pi/J2kPk 
and Pj/^kPk respectively and an edge is added between 
them unless one exists already. This process is repeated 
until M edges are made in the system leading to the mean 
degree (fc) = 2M/N. This results in the expected degree at 
vertex i scaling as ^ (^)“ p. We then have the degree 
distribution, i.e. the probability of a vertex being of degree 
k, given by P{k) ~ with 7 = 1 -|- A Thus, by varying 
a, we can obtain the exponent 7 in the range, 2 < 7 < 00 . 
Moreover the ER graph is generated by taking a = 0. 

It is worth noting that the static model described here, 
can be considered as an extension of the standard ER model 
for generating random-scale free networks, i.e. networks 
with prescribed degree distribution, but completely ran¬ 
dom with respect to all the other features. 
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III. Load distribution in Networks 

One of the main parameters of vertex centrality is the 
betweenness centrality defined as the number of shortest 
paths between pairs of nodes crossing a given vertex m- 
With this index as a starting point, Goh et al. pQ ^T], de¬ 
fined the load at each vertex v, say l{v), as the number of 
packets passing through it, under the assumption that ev¬ 
ery node sends a packet to every other node in the network 
and that packets move in parallel from origin to destina¬ 
tion through the geodesic, i.e. the shortest path between 
them. This implies that for each shortest path between 
a given couple of vertices, there is a packet passing along 
it; in the case that packets encounter a branching point at 
which there is more than one shortest path toward the des¬ 
tination, they would be divided by the number of branches 
at the branching point. Thus, in ^ the load at each vertex, 
is defined as: 

iM = E— (1) 

S^t 

where <;st{v) is the number of shortest paths going from s 
to t through V and Cst is the total number of shortest paths 
between s and t. 

Moreover, we measure here another parameter to char¬ 
acterize further the uniformity of the load distribution: the 
load standard deviation a{l) over the vertices of a given net¬ 
work. Here, to make it insensitive to the average values of 
I, we will evaluate ^{l), the standard deviation of the load 
appropriately normalized with respect to its mean. Specif¬ 
ically, a high variance of that distribution should indicate 
an unfair use of the network, and could therefore indicate a 
possible cause for congestion. Thus the load standard devi¬ 
ation gives a measure of how the network topology can lead 
itself to a fair exploitation of the network vertices. Note 
that from a communications point of view it could be de¬ 
sirable to minimize a{l) in order to make the exploitation 
of the network resources as fair as possible. 

We have computed the load distribution by varying the 
a parameter. In Fig.l the average load is reported while 
in Fig.2 we depict its standard deviation. Note that for 
a = 0, the resulting topology is the standard ER network; 
for a = 0.5, it is a scale-free with 7 = 3; for a = 1 it is a 
scale-free with 7 = 2. 

As the network transition from random to scale-free oc¬ 
curs, we observe that the average load decreases, since the 
presence of hubs results in a shortening of the mean dis¬ 
tances between vertices. Nevertheless this happens at the 
expense of the fairness of the network resources exploita¬ 
tion. Evidence of this is shown in Fig.2, where the load 
standard deviation increases with a. This indicates that 
relatively few vertices are drawing most of the network 
load. 

It needs to be stressed that the load behavior fails in 
describing real communication on networks, since, ideally, 
packets are supposed to travel from sources to destinations 
directly, without having to be stored along their pathway 
in the nodes queue. That is equivalent to assuming that 
node queues have an infinite transmission rate. 



Fig. 1 

The average load at the vertices is given with respect to the 
parameter o;, while keeping the number of vertices fixed (at 500) 
and of the edge degree (3 per node). 


Nevertheless we will show here that the network com¬ 
munication behavior is actually affected by the load dis¬ 
tribution, since the load parameter gives the probability 
for a generic packet to be forwarded to a given node. In 
order to better characterise how the load distribution af¬ 
fects the network performance, we need first to choose an 
appropriate traffic generation model. 

IV. Model of Network Data Traffic 

We use the family of Erramilli interval maps as the 
generator for each LRD traffic source, (Erramilli et al., 
1994), |T2j within the network. The maps are given by 
/ = /(mi.ms.d) : ^ ^ = [0,1], where: 

x + {l-d){x/d)'^\ xe[0,d], 

X - d{{l - x)/{l — d))"'^ , X € {d,l], 

where d S (0,1). The map / is iterated to produce an orbit, 
or sequence, of real numbers Xn G [0,1] which is then con¬ 
verted into a binary Off- On sequence where the n-th value 
is ’Off’ if Xn G [0, d], and ’On’ if Xn G (d, 1]. If the map 
is in the ’On’ state, each iteration of the map represents a 
packet generated. The parameters mi, m 2 G [3/2, 2] induce 
map intermittency. When mi = m 2 = 1.5 we have short 
range dependent binary output and this becomes fully long 
range dependent binary output for Max{mi,m 2 } = 2.0. 

The network involves two types of nodes: hosts and 
routers. The first are nodes that can generate and receive 
messages and the second can only store and forward mes¬ 
sages. The density of hosts p G [0,1] is the ratio between 
the number of hosts and the total number of nodes in the 
network (in this paper we take p = 0.16). Hosts are ran¬ 
domly distributed throughout the network. 
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Fig. 2 

Normalized STD of vertex load o-(/) is reported as varying the a 
parameter, under the same hypothesis of figure 2. 


A routing algorithm is needed to model the dynamic as- 
pects of the network. Packets are created at hosts and sent 
through the lattice one step at a time until they reach their 
destination host. 

The routing algorithm operates as follows: 

(1) First a host creates a packet following a distribution 
defined by a chaotic map (LRD), as described above. If a 
packet is generated it is put at the end of the queue for 
that host. This is repeated for each host in the lattice. 

(2) Packets at the head of each queue are picked up and 
sent to a neighboring node selected according to the follow¬ 
ing rules, (a) A neighbor closest to the destination node 
is selected, (b) If more than one neighbor is at the mini¬ 
mum distance from the destination, the link through which 
the smallest number of packets have been forwarded is se¬ 
lected. (c) If more than one of these links shares the same 
minimum number of packets forwarded, then a random se¬ 
lection is made. 

This process is repeated for each node in the lattice. 
The whole procedure of packet generation and movement 
represents one time step of the simulation. 

V. Effects on network performance of varying 

THE UNDERLYING TOPOLOGY 

We have compared three different topologies: random, 
scale-free with 7 = 3, scale-free with 7 = 2 in order to 
evaluate the effects of the underlying topology on the net¬ 
work performances. 

The networks we consider have different degree distribu¬ 
tions but are characterized by the same number of available 
resources, that is by the same number of vertices and edges. 
For the only case where the resulting network is not fully 
connected, we have only considered the giant component. 
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Fig. 3 

Number of delivered packets versus the generation rate, A. Three 
different networks, random, scale-free with 7 = 3 , scale-free with 
7 = 2, have been compared, while keeping a fixed number of vertices 
(500) and the edge degree (3 per node). 


In Fig.3 the number of delivered packets, or throughput, 
has been plotted as a function of the generation rate, A, 
for the three considered topologies. Scale-free networks 
show the least effective performances in that the number of 
delivered packets is lower than for random networks, with 
Poisson degree distribution. For scale-free networks, those 
with 7 = 2 are still less effective than those with 7 = 3. 
Though our analysis is purely qualitative, we would like 
to point out that the real Internet has a power-law degree 
distribution with 7 = 2.2 [^. 

Notice that the differences among the different consid¬ 
ered topologies, increase for higher values of A. In partic¬ 
ular random networks seem to behave better than other 
networks under high traffic rates. It is worth noting that 
this is in strong agreement with results shown in m 

In Fig.4, the delivery time for packets to reach their des¬ 
tination has been plotted versus the generation rate, A. 
The results are in accordance with those for throughput: 
the highest delivery time have been achieved for random 
networks, the lowest for scale-free networks with 7 = 2. 

The reason for this is that packets that are stored in 
the routers’ queues without being delivered to their desti¬ 
nation, increase the time needed for other packets to reach 
their destination. Moreover scale-free networks show a van¬ 
ishing value of the critical load A, i.e. the value of A at 
which a phase-transition occurs |3], with respect to ran¬ 
dom graphs. 

Consequently, although scale-free networks are charac¬ 
terized by a shorter characteristic-path-length d, they 
show worst performances in terms of communication. We 
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Fig. 4 

Delivery times versus the generation rate, A. Three different 
networks, random, scale-free with 7 = 3 , scale-free with 7 = 2 , have 
been compared, while keeping fixed the number of vertices (500) 
and the edges (3 per node) 


conjecture this could be explained in terms of load distribu¬ 
tion. In fact, as we have already observed, that distribution 
is characterized by (i) a decreasing mean (ii) an increasing 
standard deviation, as the networks becomes scale-free (es¬ 
pecially scale-free networks with low degree distribution 
exponents). As a result, since the model is uncontrolled, 
most packets will go through the bottle-neck vertices, lo¬ 
calizing the jamming in some zone of the network, further 
inducing the onset of congestion. 

VI. Conclusions 

We have shown how topological transitions in a given 
network from random to scale free affect the load distribu¬ 
tion on the network itself. In particular, we characterised 
such load distribution in terms of the average load and its 
standard deviation. We observed that as the topological 
transition takes place, the network performance worsens 
and the load tends to become more localised (higher stan¬ 
dard deviation). 

Using a novel LRD traffic generation model, we then 
characterised the effects of localisation of the load in terms 
of the typical parameters used to measure performance of 
traffic on the network; namely the number of delivered 
packets and the average delivery time. 
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